Method and apparatus for scaling input bandwidth for bandwidth allocation technology

ABSTRACT

A method and apparatus are provided for scaling an input bandwidth for bandwidth allocation technology. An original bandwidth count value of an input flow is received. A bandwidth scalar constant is provided and used for scaling the received original bandwidth count value to provide a scaled bandwidth value between zero and one. The scaled bandwidth value is stored and used for calculating a transmit probability for the input flow.

FIELD OF THE INVENTION

The present invention relates generally to the data processing field, and more particularly, relates to a method and apparatus for scaling an input bandwidth for bandwidth allocation technology.

DESCRIPTION OF THE RELATED ART

FIG. 1 illustrates a conventional switch or router used in a computer network. Congestion avoidance is a function that is provided in order to help computer networks deal with periods of time when there is not enough bandwidth to send all messages between computer systems. Congestion occurs, for example, if a router connects more than one input systems to more than one output system and all input systems are trying to communicate with a single output line, as shown in FIG. 1.

In FIG. 1, if inputs one through five are trying to send messages to one output line, such as output four, and if buffering capability of the router cannot hold all of the incoming messages, then some of the incoming messages will need to be dropped.

The simplest but least desirable is to tail drop meaning that frames are dropped when the buffer is full. All frames in a row are dropped until the buffering is freed up enough so that the router can begin receiving.

The preferred method of handing such over subscription is to randomly drop packets according to some drop probabilities. Various methods have been proposed over the years to deal with drop probabilities. One method still in use today is known as Random Early Discard (RED). By setting system parameters which must be set up before congestion occurs, a drop probability can be created.

Another commonly used and improved method is known as Weighted Random Early Discard. The Weighted Random Early Discard method attempts to provide some improvements over RED by allowing some lines or paths higher priority without dropping packets from multiple sources. Again, a manual approach is used in setting up the system to handle the drop probabilities. Both methods employ threshold monitoring and then a drop rate supplied by the system administrators are applied.

International Business Machines Corporation of Armonk, N.Y. has developed a more recent method called Bandwidth Allocation Technology. The Bandwidth Allocation Technology (BAT) method provides an automatic system to deal with allocating the bandwidth resources of a chip or system based upon the system utilization. The BAT method applies control theory to create an automatic system that reacts to congestion.

FIG. 2 illustrates the more recent algorithm called Bandwidth Allocation Technology together with hardware for implementing dynamic congestion control in the switch or router of FIG. 1. The BAT method uses feedback of the number of bytes transmitted through the router, together with controlling constants, and creates a function that computes the desired Ti, the transmit probability or transmit ratio for a given flow. The BAT algorithm has the characteristics that when congestion occurs, the flow will see a reduction in its transmit probability Ti. When congestion is over, the transmit rate rises back to 100% when no further congestion is encountered.

FIG. 2 provides the formula used to determine Ti, the transmit probability for a given flow as follows: if Fi <=mini then Ti(t+Dt)=min(1,Ti(t)+0.1 25) else if(Fi>maxi) then Ti(t+Dt)=0.857*Ti(t) else if B=1 Ti(t+Dt)=min(1,Ti(t)+Ci*E(t)) else Ti(t+Dt)=max (0, Ti(t)−Di*Fi(t)) The value Fi represents the amount of bytes transmitted for any given line or channel i input to the router.

Ti is the transmit probability for any given input i. Ti(t+Dt) represents the transmit rate for a next period that starts after the computation is performed. Ti(t) represents the previous transmission rate, for what was just being transmitted. The transmit probability Ti is a value between 0 and 1. Mini represents a lower limit or guaranteed level where the transmit probability Ti will not be reduced. The bytes transmitted over a sample time value Fi is compared with the mini value to determine whether the flow is below the lower limit or guaranteed level.

Maxi represents an upper limit or a maximum number of bytes that the channel can transmit. The bytes transmitted over a sample time value Fi is compared with the maxi value to determine whether the flow is above the upper limit or maximum level. If a channel exceeds this number, then the transmit rate for the channel is decreased.

B represents a control signal for excess bandwidth. If the routing device is not seeing congestion, then value B is set to true or one. Packets are allowed to flow in and out without any packets being dropped. However, when congestion is detected, then the value B is set to zero.

Ci represents a constant of increase and is used when an input flow is between minimum and maximum and there is no congestion. Ci is used to allow additional packets into the system. The value E represents a history of excess bandwidth and is a value between 0 and 1. If the router is congested for only a short period of time, E will be close to one. However, if congestion has occurred over a long period of time, E will be near or at zero.

Di represents a discard coefficient or constant of decrease and is used to decrease the amount of packets entering the system during congestion. The value Dt is a complex number taking bytes into a percentage of bandwidth used for a given flow. The discard coefficient Di has a wide range of values containing a scaling value and an exponential value. The multiply of Di*Fi(t) requires a large number of bits and the complexity of an exponent requiring floating point and exponent capabilities.

A need exists for an improved mechanism for implementing bandwidth allocation technology. It is desirable to provide such improved mechanism for implementing bandwidth allocation technology that simplifies calculations, for example, that eliminates the need for a discard coefficient to contain both scaling and exponential value.

SUMMARY OF THE INVENTION

A principal object of the present invention is to provide a method and apparatus for scaling an input bandwidth for bandwidth allocation technology. Other important objects of the present invention are to provide such method and apparatus for scaling the input bandwidth for bandwidth allocation technology substantially without negative effect and that overcome many of the disadvantages of prior art arrangements.

In brief, method and apparatus are provided for scaling an input bandwidth for bandwidth allocation technology. An original bandwidth count value of an input flow is received. A bandwidth scalar constant is provided and used for scaling the received bandwidth count value to provide a scaled bandwidth value between zero and one. The scaled bandwidth value is stored and used for calculating a transmit probability for the input flow.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention together with the above and other objects and advantages may best be understood from the following detailed description of the preferred embodiments of the invention illustrated in the drawings, wherein:

FIG. 1 illustrates a conventional switch or router that may be used in a computer network;

FIG. 2 illustrates a more recent algorithm called Bandwidth Allocation Technology or BAT together with hardware for implementing dynamic congestion control in the switch or router of FIG. 1;

FIG. 3 is a flow diagram illustrating a method for scaling input bandwidth for bandwidth allocation technology in accordance with the preferred embodiment; and

FIG. 4 is a schematic and block diagram illustrating apparatus for scaling input bandwidth for bandwidth allocation technology in accordance with the preferred embodiment;

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Having reference now to the drawings, FIG. 3 illustrates the method for scaling input bandwidth for implementing bandwidth allocation technology in accordance with the preferred embodiment. In accordance with features of the invention, the bandwidth allocation technology (BAT) algorithm is simplified by performing a scaling of an original bandwidth value Fi to provide a scaled bandwidth value fi before performing the three BAT functions using the scaled bandwidth value fi. The scaled bandwidth value fi is used in the compares with the Mini and Maxi values, and is used to decrease the previous transmit probability by the multiplication of Di*fi, where Di is the constant of decrease. Pre-scaling the original bandwidth value Fi to the scaled bandwidth value fi between 0 and 1 allows Dt to be just the desired exponential decrease and the calculation of the transmit probability can be contained in a smaller number of bits without having Di contain an exponent.

First an original bandwidth value Fi, for example, a 32-bit value is applied to a bandwidth scalar 300 of the preferred embodiment and is scaled into a scaled bandwidth value between 0 and 1 represented by a 15-bit value fi. Using the scaled bandwidth value fi, provides hardware simplifications and savings in required memory for storing the constants Mini, Maxi, and Di. The values of Mini and Maxi are scaled and stored as 15-bit values. The Di value is represented by a 15-bit value with no exponent. The scaled bandwidth value fi is applied to an arithmetic logic unit 302 for implementing the simplified BAT method enabled by the preferred embodiment.

As indicated in a < compare function 304, the scaled bandwidth value fi is compared with the Mini value, where the Mini value is the guaranteed or minimum amount of bandwidth that can be used before any traffic will be discarded. The scaled bandwidth value fi is compared with the Maxi value to determine if more bytes transmitted than permissible as indicated in a > compare function 306. The scaled bandwidth bandwidth value fi is multiplied with the discard coefficient Di as indicated in a multiple function 308. The discard coefficient Di is simplified and represented by a 15-bit value with no exponent. The discard coefficient Di multiplication with the scaled bandwidth value fi is simplified without requiring a multiplication of a large number of bits or the complexity for Di including an exponent as required by the prior art implementation.

Referring to FIG. 4, there is shown an exemplary apparatus for implementing bandwidth scalar 300. A Si array 402 contains the count of the number of bytes transmitted on different channels i for a sample period. In the prior art implementation of FIG. 2, these are raw counts are used to compare against the Mini and Maxi values and multiplied against Di. Si array 402 is coupled to a first input of a three-input multiplexer 404. A first Si register 404 is coupled to the output of the multiplexer 404. A BW scalar register 408 provides a scaling constant used to convert the 32-bit value to a 15-bit value between zero and one. The resulting 15-bit scaled bandwidth value fi is easier to use in the ALU 302. ALU 302 does not need to have floating point and exponent capabilities. BW scalar register 408 is coupled to a shift 410 and a multiplexer 412 providing, for example, a shift value (bits 0 to 4) and a fraction, bits 16 to 31. The output of shift 410 is applied an input of the multiplexer 412 and to a second input of the multiplexer 404. The output of multiplexer 412 is applied to an adder 414 providing an adder output. The adder output is applied to an accumulator register 418 via a two-input multiplexer 416. The adder output is applied to a first input of multiplexer 416. Multiplexer 416 has a second zero input for initially loading the accumulator register 418 with a zero value. The accumulator register 418 is coupled to a second input of adder 414 and to a fi array 420 for storing the resulting scaled bandwidth values fi for use by ALU 302 in calculating the transmit probabilities Ti. The fi array 420 provides a previous value of fi to a third input to the three-input multiplexer 404.

By using a second scalar constant in the BW scalar register 408 and using the previous value of fi, bandwidth scalar 300 has additional capability of providing an Exponentially Weighted Averaging of fi, which may be desired where the instantaneous utilization of a link or input i needs to be smoother out. The bandwidth scalar 300 converts Si (what would have been know as Fi) into the scaled fi including the following steps. Optionally the last value of fi is taken from the fi array 420 and the second scalar constant of the BW scalar register 408 is used to scale the previous fi for Exponentially Weighted Averaging of fi. The original bandwidth count value of Si is taken from the Si array 402 and is scaled and added to the optional EWA of fi. The new fi is stored in the fi array 420 for use by ALU 302 in calculating the transmit probabilities Ti.

It should be understood that any loss in precision occurring as a result of the scaling is not significant in view of the calculations otherwise performed. The following Table 1 illustrates exemplary settings for the BW scalar register 408 given a sample time of 2.7 ms. Sampling faster will result in additional precision and different settings.

TABLE 1 Speed Suggested Bandwidth Minimum Name Line Rate Scaler Value Resolution DS1 1.544 Mbps Not suggested N/A DS3 44.736 Mbps 0xd0004572 1,365.38 bps OC-1 51.840 Mbps 0xc00077db 1,582.08 bps SMII 100.0 Mbps 0xb0007c45 8,051.85 bps OC-3 155.250 Mbps 0xb000500c 4,738 bps OC-12 622.080 Mbps 0x90004fc8 18,985 bps GMII 1.00 Gbps 0x8000636b 30,518.5 bps

While the present invention has been described with reference to the details of the embodiments of the invention shown in the drawing, these details are not intended to limit the scope of the invention as claimed in the appended claims. 

1. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology comprising: a first array for containing a count value of bytes transmitted over each flow input for a sample period; a first multiplexer having a first input coupled to said first array; a first register coupled to an output of said first multiplexer for receiving said count value of an input flow; a scalar constant register providing a scalar constant; a shift coupled to said first register and said scalar constant register providing a shifted output; said shifted output applied to a second input of said first multiplexer; a second multiplexer coupled to said scalar constant register and receiving said shift output and; an adder coupled to an output of said second multiplexer providing an adder output; a second register coupled to said adder output for storing a resulting scaled bandwidth value; said scaled bandwidth value being between zero and one.
 2. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 1 wherein said received count value includes a first number of bits and said resulting scaled bandwidth value includes a second number of bits; and said second number of bits is less than said first number of bits.
 3. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 1 wherein said received count value includes 32-bits and said resulting scaled bandwidth value includes 15-bits.
 4. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 1 includes a second array for storing said resulting scaled bandwidth value for each flow input.
 5. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 1 wherein said first multiplexer having a third input coupled to said second array for loading said first register with a previously stored scaled bandwidth value for said flow input.
 6. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 5 wherein said scalar constant register providing a second scalar constant for scaling said previously stored scaled bandwidth value and providing an exponentially weighted average scaled bandwidth value.
 7. Apparatus for scaling an input bandwidth for implementing bandwidth allocation technology as recited in claim 5 wherein said adder having a second input for adding said second register scaled bandwidth value and said exponentially weighted average scaled bandwidth value. 